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Abstract. In this short paper, we overview and extend the results of our papers [1, 2, 3], where we 
use an analogy with statistical physics to describe probability distributions of money, income, and 
wealth in society. By making a detailed quantitative comparison with the available statistical data, 
we show that these distributions are described by simple exponential and power-law functions. 



The equilibrium statistical mechanics is based on the Boltzmann-Gibbs law, which 
states that the probability distribution function (PDF) of energy e is P(e) = Ce~ £ l T , 
where T is the temperature, and C is a normalizing constant. The main ingredient in the 
textbook derivation of the Boltzmann-Gibbs law is conservation of energy. Similarly, 
when two economic agents make a transaction, some amount of money is transferred 
from one agent to another, but the sum of their monies before and after transaction is 
the same: m l +m 2 = m\ + m' 2 . So, money is locally conserved in interactions between 
agents. Then, by analogy with statistical physics, one may expect that the equilibrium 
PDF of money m in a closed system of agents has the Boltzmann-Gibbs form P(m) = 
e -m/T jj ^ yyhgj-g j i s e ff ec tive "money temperature" equal to the average amount 
of money per agent. This conjecture was confirmed in computer simulations of various 
simple models of money exchange in Ref. [1] under quite general conditions of the 
time-reversal symmetry and sharp boundary conditions at the lower end of m. In a more 
general case, where the time-reversal symmetry is broken or debt is permitted [4], the 
probability distribution of money may deviate from the Boltzmann-Gibbs law. A popular 
review of these models can be found in Ref. [5]. 

It would be very interesting to compare these results with the actual PDF of money 
in the society. Unfortunately, we were not able to find such data. On the other hand, we 
found a lot of statistical data for the PDF of income r. In Fig. 1, we show the IRS data 
for the distribution of individual income in USA in 1997 [6]. The left panel shows the 
cumulative PDF N(r) = P(r ; ) dr' , which gives the fraction of individuals with income 
greater than r. The main panel shows the data in the log-log scale, and the inset in the 
log-linear scale. The straight line in the inset demonstrates that, for incomes below 100 
k$/year, the income PDF has the exponential Boltzmann-Gibbs form P,(r) = e~ r / R /R, 
where R is the effective "income temperature" equal to the average income. On the other 
hand, for very high incomes above 100 k$/year, the PDF changes to the Pareto power 
law, as illustrated by the straight line in the log-log scale. The fraction of population in 
the power-law tail is very small, less than 3%. So, the income distribution of the great 
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FIGURE 1. Left panel: Cumulative probability distribution of individual income in USA in 1997, 
shown in log-log scale (main panel) and log-linear scale (inset). The points are the raw data from IRS, and 
the solid lines are the fits to exponential and power-law functions. Right panel: Lorenz plot of the same 
data points, compared with function (2) shown by the solid line. 



majority of population is described by the exponential Boltzmann-Gibbs law. 

Another standard way of representing income distribution is the so-called Lorenz 
plot shown in the right panel of Fig. 1. The horizontal axis of the Lorenz curve, x(r), 
represents the cumulative fraction of population with incomes below r, and the vertical 
axis y(r) represents the fraction of the total income this population accounts for: 

As r changes from to <», x and y change from to 1, and Eq. (1) parametrically defines 
the Lorenz curve in the (x,y) space. The diagonal line y = x represents the Lorenz curve 
in the case where all population has equal income. The inequality of the actual income 
distribution is measured by the Gini coefficient < G < 1, which is the area between 
the diagonal and the Lorenz curve, normalized to the area of the triangle beneath the 
diagonal: G = 2 Jq(x — y) dx. 

For the exponential PDF P^r), the Lorenz curve is y = x+ (1 — x)ln(l — x), and the 
Gini coefficient is Gj = 1 /2 [2]. As shown in Fig. 3 of Ref. [2], G 1 = 1/2 is in overall 
good agreement with the Gini coefficient given by IRS for the last 20 years. However, 
because the PDF shown in the left panel of Fig. 1 is a mix of exponential and power-law 
functions, the Lorenz curve is modified as follows: 

y = (1 -f)[x+ (1 -x)ln(l -x)} +fS(l -x). (2) 

In Eq. (2), the weight of the first term is reduced by 1 — /, because the normalization 
factor for y in Eq. (1) differs from the purely exponential case. The remaining weight / 
is the fraction of the total income contained in the power-law tail in excess of the expo- 
nential law. Because the fraction of population in the tail is very small, this contribution 
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FIGURE 2. Left panel: The histogram of the family income PDF for the families with two adults in 
1996 [2, 8] and its fit to Eq. (3). Right panel: The incomes of spouses (r l , r 2 ) and (r 2 ,r l ) for the families 
with two earners in 1999 [9]. 



is approximated by the delta-function in Eq. (2). Thus, the last term in Eq. (2) can be 
called the "Bose condensate". Our definition of the "Bose condensate" differs from Ref. 
[7], where it was associated with the case, where the integral y(r) diverges at the upper 
limit, and almost all income is concentrated at the upper end. We have never encountered 
such a situation in the data. We find that y(r) always converges, and the "Bose conden- 
sate" fraction / has a modest value. The right panel of Fig. 1 demonstrates that formula 
(2) agrees very well with the data, giving / = 16% for 1997. However, Fig. 3 in Ref. [2] 
shows that / monotonously increases in time for the last 20 years. Wealth distribution is 
also described by formula (2) with / = 16% [3]. 

Now let us discuss the distribution of income for families with two earners. The family 
income r is the sum of two individual incomes: r = r l +r 2 . Assuming that the individual 
incomes r x and r 2 are uncorrected and have exponential distributions, the family income 
PDF P 2 (r) is given by the convolution 

*2 W = f Q p i ( r ') p i ( r ~ r ') dr ' = ]p e ~ r/R - (3) 

As shown in the left panel of Fig. 2, Eq. (3) is in excellent agreement with the data. 
The points in the right panel show (r l5 r 2 ) and (r 2 , for the families in the data set. It 
demonstrates that there is no significant correlation between incomes of spouses. 

For the PDF P 2 (r) (3), the Lorenz curve was calculated in Ref. [2], and the Gini co- 
efficient was found to be G 2 = 3/8 = 37.5%. The left panel in Fig. 3 demonstrates that 
these theoretical results are in excellent agreement with the data for the last 50 years. 
The right panel shows the World Bank data [11] for the average values of the Gini co- 
efficient in different regions of the world. For the well developed market economies of 
West Europe and North America, the Gini coefficient is very close to the calculated value 
37.5% and does not change in time. In other regions of the world, the income inequality 
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FIGURE 3. Left panel: Lorenz plot calculated for the family income PDF (3), compared with the US 
Census data points for families during 1947-1994 [10]. Inset: The US Census data points [10] for the 
Gini coefficient for families, compared with the theoretically calculated value 3/8=37.5%. Right panel: 
Gini coefficients for households across the globe for two different years, 1988 and 1993 [11]. 



is higher. The special case is the former Soviet Union and East Europe, where inequality 
was lower before the fall of communism and greatly increased afterwards. In statistical 
physics, the exponential Boltzmann-Gibbs distribution is the equilibrium one, because 
it maximizes the entropy of the system. By analogy, we argue that the equilibrium dis- 
tribution of individual income in society is also described by the Boltzmann-Gibbs law, 
and the equilibrium inequality is characterized by the Gini coefficients G 1 = 1/2 for in- 
dividual income and G 2 = 3/8 for family income. Fig. 3 shows that such an equilibrium 
state of maximal entropy has been achieved in developed capitalist countries. 
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